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INTRODUCTION 

The increasing frequency of nosocomial infections due to 
methjculm-resuaant Staphylococcus aureus (MRSA) and van- 
annycm-resistanl. Enterococcm faecium (VRE) and the fear 
that toga-level vancomycin resistance wffl eventually spread to 
staphylococci underscore the need for vigilance in the continu- 
ing war against pathogenic microbes (18, 39). Current widely 
used antibiotics „ * fl 

vital ceBular functions: cell wall, DNA, RNA\ and proTiabio- 
nutem (Table 1), and instances of resistance affl^S 

,v u «? T T 1 ^ 6 ^ flDd WeD documented (48). Thus, there 
lmle dD ^ =^w antibiotics are needed to combats 
growmg problem of antibioticresistant bacteria, and terse tin* 
of new pathways will likely play an importable in 
l^ nSW « tfltaM « a -> feet, a number of crucial cellular 
pathways, such as secretion, cell division, and many metabolic 
junctions, remain untargeted today. In the last 3 years, high- 
tbToughput automated random genomic DNA sequenring to- 
£5S! 7 r0blUt ^Smrat assembly .tools has delivered a 
• wealth of genomic sequence information to assist in the search 
lor up targets. In many cases, entire biochemical pathways 
can be reconstructed and compared in different pfthogeS. 
The purpose of this nunireview is to indicate where this infor- 
mation can be found, to outline some of the ways in which it 
can be used, and to describe new tools to take advantaged 
genomic sequence information in the drug discovery process. 

Each potential new antibiotic must meet a number of crite- 
ria before it ^-approved for use, and the choice of an appro- 
priate target is the first atep in this process. It is helpful to 
review the utility of genomic information with regard to wme 
of the key criteria which antimicrobial targets must meet In 
general, (1) a target should provide adequate selectivitv and ' 
spectrum yielding a drug which is specific or highly selective 
.grot the microbe with respect to the human host but X 

Srt .P Z B P tania of pathogens; (ii) a target 

should be essential for growth or viability of the pathoeen^t 
least essential under conditions of infection; and (iii) som" 

assays and high-throughput screens can be built. Identification 
of potential new targets can proceed from any one of these 
criteria, but ultimately all must be met by a successful target 
For example, a variety of methods may be used to find eenes 
which are essential for the survival of an organism undlr de- 
fined conditions or which are necessary for infectivity in an 
animal model. Comparative genomics may be used to identifv 
potential targets which are shared across multiple microbial 
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t * P f ma ^ y SBqUenCB siaaarity based, may 
h n lZL pn f 01 fu ? c * ,n oimost g° nes » thai spetific 
pathways can be targeted. As discussed below, genomic se- 
quence information provides assistance in all of these areas- 
selectivity, spectrum, functionality, and essentiality (Fig. 1). 

CURRENT RESOURCES FOR GENOMIC SEQUENCE 
• AND FUNCTIONALITY INFORMATION 

Numerous databases are now available which contain both 
sequence and functionality information. Most of these are sc 
cessible over the Internet through convenient Web browser 
mterfflces. Many also permit downloading of sequence infer- 
nation for use on local servers. Sequence databases now cos- 
tern the nucleotide and predicted amino acid sequences of 
virtually every gene m the model microbes Escherichia col ' 
Bacillus subtdis. and Saccharomyces cerevisiae as well as in a 
variety of other bacteria .(Table 2; aversion of this table is 
updated regularly by The Institute for Genomic Research 

SSte 7 v b SItB: hc ? : /^-tiST.org/tdb/mdb/mdb 
.html). These databases are the result of extensive analysis 
of the genomicsequences of those organisms. Open reading 
names have been analyzed by sequence comparison and to 
codon usage to ,dentify those which are mostHkely to reprei. 
sen tmnscribed genes. Putative functions have been assigned 
10 slightly more than half of the genes in the model organisms 
based on sequence comparisons to genes of known function 
in other organisms, shared sequence motifs, or clusterine of 
£££ CM J^ll elated familieE ' Databases such as EcoCyc 
KBGG, and WTT present these datain an organized and 3 
manner (see Table 3). w 
^Recently, some commercial databases have also become 
n» by commercial subscribe™. 
These databases, generally also provide sequence mfoimation 

analysis tool? for convenient analysis of the data. For example 
«nJf f °f P reru ?, sequence similarity searches may be 
stored to provide rapid answers to complex comparative geno- 
mic queries by a subscriber. Finally, several Wet-accessible 
tes offer userul tools for sequence analysis via »q2£ 

E«l? eBrt ? K i moTif T searcheE ' *** structural compariso^. 
Examples of relevant Internet sites providing databases of se- 

^es^mS 0 ^ inf0nnatiM Ud t00k - 

=J^ 6 *^ advan f 6 m microbial genomics will be the avail- 
abihty of the comp ete genomic sequence from multiple strains 
of a single bacterial pathogen. The discovery of genes con- 
served in multiple pathogenic strains or me recognition of 
genes found only in the most virulent strains are examples of 
the power such genomic comparisons wffl provide. Sequence 
for a second strain of Helicobacter pylon has appeared and 
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TABLE 1. Gene targets of widely used antibiotics 



Target category and . 
gene product 



Antibiotic class 



Protein Byntheris 
305 ribocomal robunii., 
5DS ribasomal submit, 
tRNA IU synthetase- 



«.AxxunDglycoside£, tetratychnes 



Elongation factor G., 



-.Ivdflcrolidefi, chionanphcoicol 
— Mupirocin 
- Jasidic arid 



Nucleic acid synthesis 
DNA gyraae A subunir; topo- 
isomenise* TV 



DNA gyrnre B subunli- 



SNA potymenst beta subi 
DNA „ 



..Qumolonef 
..Novobriaric 

-i Ri'fRrqp fn ' 

-Metronidazole 



Cell wall peptidoglycax) synthesis 
Transpeptidases 
D-AJa-D-AJa ligase substrate 

Antimetabolites 
Dihydrofolate reductase. 



3 eta -1 actains 
™.Glycopeptides 



Dibydropteroatc synihssu . 
Tatty acid synthesis-™™ 



■M> M .TrimeChoprini 

Stllfrm at^iri^ p 

Isoniazid 



sequence for a second strain of Mycobacterium tuberculosis wiD 
appear soon (Table 2). 

COMPARATIVE GENOMICS TO ASSESS THE 
SPECTRUM AND SELECTIVITY OF A TARGET 

One powerful use of genomic sequence information is to 
compare all of the identified genes in different, bacterial patiio ' 
gens to determine which genes are, or are not, snared by var- 
ious species. Indeed, Tatusov et aL (50) have suggested that 
gene families conserved among bacteria but missing from eu- 
karyotes comprise a pool of potential targets for broad-spec- 
trum antibiotic development. An early step in this direction 
was taken by Musbegian and Koonin (36), who identified 
256 genes shared by the two completely seguenced bacterial 
genomes at that time, those of Haemophilus influenzae and 
Mycoplasma genkalium. On the other hand, genes which are 
apparently unique to a species such as H. pylori might be ideal 
for targeting that species with a narrow-spectrum antibiotic As 
the number of sequenced bacteria] and fungal genomes grows, 
so does the ability ro find genes common id most microbial 
pathogens or truly unique to a particular species. For example, 
Arigoni et al. (6) identified 26 genes in R cok\ most of which 
were conserved in the B. subtiks, M. genitalium, R influenzae, 
H. pylori^ Streptococcus pneumoniae, and Borrelia burgdorferi 
genomes. They reasoned that this list of genes, which had no 
predictable function, contained novel targets for broad-spec- 



trum antibiotic development. These analyses can be extended 
by including sequence comparisons to eukaryotic genomes as s 
means to examine potential selectivity of a target (50). For 
example, Arigoni et aL (6) reported that 15 of 26 proteins 
broadly conserved across bacterial species also exhibited sig- 
nificant sequence similarity to proteins in S. cerevisiae and, 
therefore, represented targets which, in an assay, might iden- 
tify compounds that also have human toxicity. "While tbest 
targets could simply be avoided, it should be noted that the 
• targets of the majority of marketed antimicrobial agents show 
some conservation with mammalian proteins. 

As in all sequence comparisons, the search parameters and 
the quality of the input data, e^g., partial human or mammalian 
sequence infonnation, are critical Relevant issues which must 
be addressed include questions such as the roBcwiag. ,, What 
degree of sequence ffirnflariry to another bacterial genome 
indicates a shared gene 7 What degree of sequence simflariiy to 
a mammalian gene warns of a possible toxicity problem? Since 
sequence similarity-searching algorithms allow nearly com- 
plete flexibiliry in the choice of these parameters, some known 
examples are necessary to calibrate the method. Mushegian 
and Koonin (36) used a BLAST? score of 9D as the cutoff for 
defining a biologically relevant relationship between two pro- 
tein sequences. The appropriate cutoff score for exclusion of 
genes with apparent mammalian homologs may be more gene 
' specific. Some examples reveal a general trend Trimethoprim 
is a highly selective inhibitor of- bacterial dihydrofolate re- 
ductases (DHER) despite the fact that the human and R col 
DHER gene products share 28% amino arid identity over the 
length of the two proteins (40). Similarly, the atiinolones are 
highly selective against bacterial gyrases despite the fact that 
the C^ternomal domain of human topoisomerase II shares 20% 
amino acid identity with R coli gyrase A (25). Fluconazoles are 
highly selective for fungal lanosterol 14-a demethylases s even 
though the human and yeast gene products share 37% amino 
acid identity oyer their full length (5). These sequence identity 
percentages translate into BLASTP scores of 132, 125, and 
301, respectively, in" a search of a large nomedundant protein 
database comprised of sequences from GenBarik, SwissProt, 
and PUc Therefore, •exclusion of genes having apparent mam- 
malian homologs with scores >150 would likely be suitable for 
a search of bacterial targets, but the score cutoff would have to 
be raised to allow identification of the broadest set of antifun- 
gal target genes. 



ID ENTBTI CATION OP ESSENTIAL TARGETS 
EXPERIMENTALLY 

Genomic sequence mformation is not required for discov- 
ering essential genes, but such information does facilitate the 
process, Genes which are essential to pathogenesis and prevent 



CompaiaiivB QenansicB 
Selectivity &. Spectrum 



Function*] 
Pcmurrs 



Validation of 



Validation of 
Expresrioa 



Assay & 
HTS . 




Sequence similarity 
Motif content 
Protein iairfflrrion 
OperDn neighbors 
Structural similarity 



DurctsiJ allele exchange 
Rjurdcsi cause oscd- based? 

QsiiBiic fooninntrnn 

GAMBIT 

STM 
Conditional lcihnls 
VITA 



Micro-array 

hybridization 
RT-PCR 
IV ET 
DH 



Biochemical assays 
Whole-cell assays 
Lismii-afTinily osfu&vb 



rts^ctiv^iy SChCmnUC ° f gEn0miC t00i£ &PPii&d W anlimicrobia) - dni g discovery. See the ten for details, G+ve and G-ve, gram positive an 6 gram jjegstive, 
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Internet resource 

ww.tigr.org/tdh/mdb/bidb/hJdfaJitniJ 
ww.ti^.nj^tdb/tedb/n^dh/mgdbJi^ 
ww.ngr.c^tzibAndb/n^dij/mjdi^iiiiiii 
wwwJcaxusa^jp/rynno/cyanoJmnJ 
wwarihiLUDi.heJdfilbeEBJc/M meumDoiae/ 

MF_HtnneJitml . 
*P»EfK.m^tocJ!em4iipg.te 

_ggnD Tnr .. hTmlT or gcnome-wwwjtcnford 
■ .odu/Sarciiaroraycw 

www.tigr.nrgrtdb/mdb/bpdb/iipdb±nni 
www.geaciics.wisc.ttiu/ 

www^eiuunecorpxoffl/geiift/BapieneeV^ 

mf . thnTio bnctei/iihairncUmnJ 
www.pasteur^/Bia/SubtlLisLbtmJ 
www.tigr.org/tdbAndb/flfa b/aftib Joimj 
www.tigr.org/tdb/mdbAbdh/bbrib.iitniJ 
www.ncbiTilm.nfh gDv/cgi^la/Eolrcg/ 

framik7db** Geoomo&giw 133 
www.niojjit&ga.jp/ot3iJb_indexhiinJ 

www^gcr^axii/ProjecDS/M tuberculosis/ 
www.if " ** ' ' " ' - ~ 



evoiutioiUnnBaULtt/^/gnonucj^^ 
www.gononiecorpxoin/ijFylori or www^atro- 

boKDn.eom/b pylori 
wvvw.dgr.Drg/dg-bm/BJaHtScnrcb/biast 

.cgi? organism « i ni i _tnbereui osi£ 



TABLE X Sequenced microbial genomes 



StraiD(ti) 



Haemophilus i^uenme RD • KW20 
Mycoplasma genuauum G-37 
M ethanococua jannaschil DSM2fiSl 
Synechccystb up. PCC6B03 
Mycoplasma pneummuac M129 

Saahavmycu ceratee S2B8C 



Behcobatier pybri 
Escherichia coU 
Mtthanabactcrmm therms 

aumtrophiaum 
Bacillus sublilis 
Archaeoglobus fuigidus 
Borrelia burgdorferi 

Pyrvaonus fwrikashii 073 



Mycobacterium tuberculosa 
Treponema pallidum 
Chlamydia trachomatis 

Rickettsia protoaieJdi 
Helicobacter pylori 



25695 
K-12 
delta H 

163 

VC.16.DSM43M 
B31 
VH 



H37Rv 
Nichols 

ScrovarD p/UW3/Qc) 

Madrid £ 
JB9 



Mycobacterium tuberculosis CSU#93 



Size 
(Mb) 



1*3 HGR „ 

ass hqr j; 

US TTQR • g 

3.57 Kn^DNAReie^iostote 27 

0.B1 Uaivendry of Heidelberg 23 

33 European and North Americau 27 
Consortium 



IM TTOE 

4J UnJveisby of Wisconsin 

1.75 Goose lienpewto and Ohio 

Sma Vimmity 

4 2 IntrniBtioLoJ Consortium 

2.18 TIOR 

1.44 TIOR 

155 Drveira 



51 
7 
43 

31 
29 
14 
10 



1MB National Iaafnite of Technology 28 

' and Evaluation 

4-4D Snngor Crane g 

U4 TIOR end Unfvcrehy of Toms 16 

US Univcniiy of Caffiomia at Bedce- 46 

lsy and StBnfurt Unreality 

Ul Vnheatcy of Uppsala 4 

1M Gonorijt Therapeutics end 3 

Astra AB 

11031 tbpubtoted 



colony formation in a conditional-lethal manner are DOtentkl 
• SKf T Thfc assumes tblfa K 

- ganic molecule which inhibits the activity of an esseiSene 

productwould eitherkill or inhibit the growth of tbSbSffi 

genes can be discovered throngh classical mutagenesis tech- 
niques Availability of the sequence of the genome IKS 
the M sequence of each mutated ^^fcS 
cellular role a, can be gleaned from a short sequSsS 
on a complementing plasmid insert. Tim additional Mom*. 
^Jfr^* 6 P r ° ceSEin E of a xnutational s^nct 
SS&P ^ eadH ^ 011 the . avafla t3i«3' of genetic toolsfor tie 
microbial speaes in question, a variety of molecular genetic 
method, can be used to discover essential genes. For £ 

«™ ° f "* instructed tranTposon^ 

am (11), or genes ran be mutated to a Mnditior^-lemal form 
In principle, such conditional mutants can be used m whok-' 

the cells may be hypersensitive to drug-like compounds Which 
act against that gene product (see below) """swnicn 
It seems reasonable, to assume that most genes which are 
essential to the cell for growth or viability on fiXSSeS 
will also be required for growth or viabffity in an toSSSf 
Experimentally, media can be varied in order to EStSi 
which are essential under the widest range of g3? ££ 
toons and particularly in rich media which may K2tTS£ 

S^V*"^ t " UB of 811 Cells cirS 

au^tropkc mutations may find sufficient nutritional s3? 
ment in the lost ussues to permit growth or at least MrShaL 
Such genes might be poor targets for new antimicrobials unless 
t mb}3S \ ^ Particular nutrient is m shott 
supply a lie host or that cells are incapable of transporting &t 
uutnent efficiently. In order to estabush that JKk 
cssentaal in an infection, a transposon-based gtne ufglng 



method called "signature-tagged mutagenesis" /STM has 
5» », W Senes which are essential ii aTS£ 

S ( ' 35) " *"* ^ CEn yi n B the disruS 

tagged genes must be grown in the laboratory prior to mmC 

S^V 1° 1116 aaim ^; he ^ be bSsed aS 

S ta^S ^^^forgMboth on laboratory S 
and man annual modeL Indeed, many of the genes' identified 

■ 5-S M -5P pMr u t0 eSCOde 7inilBnce which affectlhe 

ability of the pathogen to colonize or damage hort^ rather 
to the vtability of the patnogen. New drugs w^ichi^^ 

m ?Z P " CESEM £1? P rove seS, and Set 
to such drugs might be rare since loss or mutation ofthe 
virulence factor would also likely reduce virderacTHoTO 
otter resistance mechanisms, such as drug 
efflut pumps, could be problematic. In addition, the absMceTf 
a convenient a vitro assay for such drugs would Cper the 

STJ ntl teStmS ' " d P^es. Ir reS 2 

clear how many important antimicrobial targets would be 
rmssedby using as targets for drug discovery only toe genes 
wbciare essential for growth orviability 01 Saborato? «K 

A related, important feature of a suitable antimicrobial sens 
W » « «pression pattern in the mfection^Ihe ab !lm 
level of expression may be less important than.' informal 
about whether it is expressed at an. A hS^xeSed ab2 
f^oduct should be no more difficult » SJtSal 

high affinity win be efiective in either case- unless it is poor v 
taken up by pathogens. However, if a gene is not expressed u 
aj in an established infection of an arinalhosTlw^lb 

2 « ? ? Dttlltial ^ A "toa^ estabushed 

as being essential for growth or viability in the labomtoryby 

ditions because us failure to be expressed as an a-ctive product 
causes the pathogen to die. Knowledge that s^VtSS 
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gene is also expressed in an animal model would sureest that 
itm essential in an infection as well. Two types of methods ofier 
Herniation about gene expression. First, for genes whose se- 
quence is known, reverse transcriptase PCR CRT-PCR) may be 
used to detect transcripts in ceils grown on agar media or in 
animal infection models (47). Alternatively, for - organisms 
which have been sequenced- in their entirety, a whole-genome 
vjew of gene expression may be obtained by gridding clones 
PGR products, or synthetic oligonucleotides representing ev' 
ery gene onto a solid support Total RNA maybe isolatedfrom 
ecus grown under conditions of interest, labeled, and hybrid- 
ized to the array (12). While thorough, this iype of method 
suffers from some problems: (i) appropriate controls must be 
run to ejinunate the possibility of bacterial DNA cemtamina. 
tion m the RNA preparation, (if) probes are difficult to prepare 
because bacteria] ulRNA is notoriously unstable, and Cm) I the ■ 
whole-genonuc scale of the experiments makes the arrayed 
membranes difficult and expensive to prepare and read. A 
genetic promoter trap method termed "in vivo expression tccb- 

* m f£ rl ^ b , C m ° rc fBafiible for mQSl ^oratories 
£ TV 33) ' „ B ^approach, which has been developed for use in 
Salmonella typhimuriurn grown intraperimneally in BALB/c 
mice or m cultured macrophages, random DNA fragments are 
cloned upstream from a gene whose expression' is required for 
growth in an anima] host Cells, which multiply in vivo, are 
recovered and cloned The sequences of fragments serving as 
functional promoters in vivo are then determined. A Becond 
related promoter trap method termed "differential miores^ 
cenoe induction" pFI) has been described recently (53). The 
&stmpishing features of this approach are that fi) the gene 
used for selection encodes a modified green fiuorracenr pro- 
tein and I (a) the selection is accomplished with a fiuoresceace- 
acnyated cell sorter. If such methods can be extended to other 
bacterial species and animal hosts, they will be extremely use- 
ful for assessing random genomic fragments or specific genes 
of interest for expression in vivo. 

IDENTTFICAIION OF ESSENTIAL TARGETS 
USING DATABASES 

Potential gene targets selected from databases can be Vali- 
dated ^ examining the effect of a gene knockout on cell growth 
or viability. Recombination is almost exclusively between ho- 
mologous regions in bacterial genomes, and many common 
pathogens as well as model bacteria are transformable Ex- 
change between the chrpmbsomal wild-type allele and a ver- 
sion engineered to carry a deletion and/or an insertion of a 
drag raoBtuuu cassette is generally efficient enough to be ' 
practical id the laboratory. Interpreting the results ofsuch an 
experiment, however, may be difficult for two reasons, First 
the frequent occurrence of porycistronic messages in bacteria 
means that disruption of a gene may have a deleterious effect - 
on expression of a distal neighboring gene, a so-called "polar* 
effect In that case, the inviabfljry caused by a gene knockout 
could be-due to loss of expression of a gene other than the one 
disrupted. Precautions can be taken to reduce theseefrects by 
for example, including a moderate-strength outward readine 
promoter m the disrupted version of the allele so as to permit 
expression of the downstream gene(s). Second, the method 
works better as an exclusionary tool than as an inclusionarv 
one While success in generating a cell carrying a disrupted 
allele indicates that the gene is not essential for growth or 

,T? ^ Ccl1, fai3ure 10 S ener aie such an altered cell 
could be due to any one of multiple causes including polar 
effects or inefficient recombination in a particular genetic 
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TABLE 3. Additional Internet resources 



Database or orgnnixan'oi) 



Imanict ndtircny 



Sequence database* 
NCBI 



EBI/EMEU 
G5DB. 



SwioProt (Geneva).. 



HDP. 
SOB- 



—http://www.Hghi.rt |fn nih.gQV/Egfrez/ 

Gaaomc/orgJinnl 
n ttpV/www.da'bjjii^aajrVb tmi£_ test/ 
Welcome-eJmaJ 

^ttp^Avww.n^.oi^g£db"yiiidex_gsdbJiiniJ 
~httyifhkri rnrri ifma-edn/CandldiLhttni 



->http^MpUIfc«iihic.cdu/ 



Metabolic databases 
KEGC 
£coc 



WIT- 



~ Jj tn?V/w\vw,geriDm£aid.jp/kcgg/ 
Jittpy/ecocyaJ^gaaSystcniixoni/ 
cEOcyn/ccoeycJinaJ 
■Jnqa^WffwxTBcmini r'aVWIT/ 



Sequencing groups 
Berfceiay. 



Genome Therapeutics . 

Sanger , 

Stanford*. 



71GK 

University of Oiiahon 

University of Queensland. 
Unrvuuiy of "Washingtira . 

Washington University, 



Tools and resources 
Biomolecukr Research Tools . 

COGs™ 

NCG*. „ 



MAGPIE- 
Genobase.. 



htrp'iAvww^enamecaipxom/hoTntJit] 
ittp^Avww^gcr^c.nk/ProjeciE/ 

"TtpV/tequexieMvwjtanfordeo^gzom^ 
malaria/in dexnonl 

hrrpy/dnal.chcm.uoknoT.ndu/mriexiitmJ 
lttoV/Wwwxincb.UflxdiiJiu/aerugiDosa/ 
ittpV/cnirncni.hTnrrrb,wagbiagTOD.finu/ 
uwgc/ 

^.JittpV/g&iiQnu.Yw^dx^c/b&steiiaV 
saimDnoilaJnral 



« Jit&^/www.poblicjastatB^ du/— pedro/ 
rt_ LhtmJ 

~bup'Jjwww jjeblalnuiln.gov/COG/ 
^ttp^/www.ncgr.oTg/rnicrobc/Iridei bus 
.htm! 

ttrn!//www.mrt.n T1 j gnv/fa Dxne/gaasrerl/ 
gsnomssJitml 



Micro Underground* 



ANMR. 

WHO 

Paliea_ 



^t^/Epectcx.dm.nib.gDv:S004/ 
htra v'/wwwisuioc. edu/cainpiis/mj cr/nnrxur/ 
pub licji tzaifmd exJrnnl 
^trn/Mw.wdcmxikBa.go .jp/ 



University of Kansas. 

University of Georgia— 

Tripos -— 

Motif — ^ 

PodanL. 



.^ap^/wpWiWhoxb/WdcomeJjDii] 
— btn^y/www.oiirw j2c.uk/~rh. bmODlAetbook/ 
chnpter.ntatf 

inpi//WwJijmcxdii/rac Etrcb/fgsc/niaiij 



— btrp^/uingiis.geasiics.ugL- esdnSOBty 

httpy/www. aipas-com/a i res .htm! 



Gc3jTHRBADER„ 



Jin^^/dim^toiifbnLEDU/i dentiry/ 
~-bttp^^danLraip.biocberDjmpg.de/ 
M J:ttr^/gJobiiLbiD.waiT?icLa.&uk/gcnoino/ 
fenomiebnn] 



One solution to this problem is to carry out allele exchange 
as a two-step process (20, 32). In K coU t for example, the dis- 
rupted aUele together with the vector enrjying it can be 

^ S^ 0mB . by mB£ms of a e croBsover, 
a so-called Campbell insertion." Recombination between ho- 
mologous regions on the two copies of the allele: now on the 
chromosome will eliminate the vector sequencers and either 
copy oi the allele Which copy is eliminated depends upon 
which regions of homology were involved in th& recombina. 
tion. Failure to find cells retaining only the disrupted allele 
strongly suggests that such progeny are inviable:. Success in 
nnding cells retaining only the wQd-type allele confirms that 
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recombination is efficient in this genetic interval TTrmim*. • 

2 SSy compelMt ^^^T^. 

em Bad S. pneumoniae, double-crossover events IrTe*. 

While this complicates evaluation of essential genes tath«i' 
orgamsms, n provides ■ convenient aeSd fatoS 

„.l. 11B !I apprDacl1 P* 0 ™* 1 " to accelerate the process of eval- 
uatag the essentiality of genes. Smith et al {ST!? ffi 

SSSfJ*"* makssuseofa q«Hsi-random iranspSe 
S? J? 8 S nB ? e a rich array of gene taoeke^Tin a 
. .popuhtton of cells. Further transposition is shut ottand thl 

population* then grown underav^riety of tSSStoZ DNA is 
. prepared from cells m the various growth popuSX Sine 

Siu prS ,Ba 'l ,V P P ^P^tion todetermin^fl * ii 
. yield PCR products between a gene-specific primer .„h7 
transposon-specific primer. Failurf to xmdS PCT L^L 

automated fluorescence se^ncmg SSffJd 1 2™? 
^available software paScage. JSS$Sif£^SS£ 
method is the existence of a gene-to-transposon PCR prodm 
' ^t'X^+J* 0611 P°P**™ Prior to theSdW rf 
the "Planter that SnXj 
■ T V °! far n™*""*™. The efficiency fK 
method derives from the use of random transposonsto taM 

Recently, a modified version of this method, caDed "eeno- 
by in vitro transpoliS ((S5£ 
an. has been appued successfully to two bacterial soeriesTl 
In this variation of generic foowrmting, the ttS2E£f 

■Jzr^? - 01 S -P naun °^ in vitro, and the mutations were 
mtroduced into these naturally competent hwtaXiTto 

a true Xo, the focus on 10-kb DMA. segments permits near 
. saturation mutagenesis with thtmarmer family tSSoJjH 
J~rJ, which shows little or no insertion iteiSS 
authors identified four essential conserved genLof^to 
function from a total of 23 analyzed ' ^ 

Currently, the main limitation to this method is a remiire. 
ment for an efficiently transformable host bacterium 
mutahons generated in vitro can be evaluated «S? f 

ods include the following; (i) essentiality of me function of » 
gene that is duplicated or ha, afunctional pSffSJ b 
analyzed since footprinting assesses the fitness ofTs£eL 
mmgemzed gene; (i ) polarity ef ects, although not a proolS 

from baS fi 'r -f 10 ^"^retation of data JX™ 
nom bacteria; (m) the correlation of footprinting duavfth 

f^H ^°f 0Ut - dflta has not beM »nfemed P in any on£nSr 
and (w) footprinting data are technically difficult to |8E5 
or a variety of reasons, including the facts that some 

^ m tbe C-terniS Sg 

Md cbUs car3 ? in S taawttou in some 
genes dgplay an intermediate slow-growth phenotype ( °e" 
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TOOLS FOR PREDICTING THE FUNCTION ' 
OF GENE PRODUCTS 

baSK?* nW 811 ° f P rediaed functional asBigmnents 

^5°°= of closesMelated protein hTitS 

S„£ ^'Sf d ba i ed 011 te se( J usn " ^ariry to a «ne 
product of known function. In other cases, the Ihain ofrfS! 

• ^ v tte genes m bacte rial genomes either lackrie- 

putehve funcnon or functional feature such as tbTpresena of 

nSfwf ^^T 15 0f «&otoB<»B roups of ™ BS 

fKne?^^ T ^^o^ftSctional prtdiSS 
ior genes in the H jylon genome (50) 

L^L.f? 1 Predicting function of a gene product For ' 
example, a gene product, with no dgnmcanfseouencelelatiDrf 
^ aprotein of known ranctio^wh^^etyK 
TJ^t ■ , * p "! of a Po^daronic messagev witt oth« 
with the known gene products. In the R M » ^,„™r7v 7 

hm ^^? &™ W, and the J^^uTgene 
j'aiW appears to be in an operon with th* ^.JT„" ■ s D 

St^v^t* 8 P^?? 1 ^. DNA cleavage o/iSSSJ, 
Alitor OTP hydrolysis, and bindkig, to name a few The S 

S^EEW idc ^ M ^?of a?^S 6 4i; 

25 2 B f u,d . u low > ^ ^ method may be warranted ff 

ated vS* ^ PBI3S uf ,UBBBt 1156 P re3M =« of a motif 
ated with an assayable function. An attractive alteTTjSvT* V« 
focus on assays which do not require taSKerf^SrfWl^ 
funcuon of a gene product (see below) 6 Mar 

THE I FUTURE! DEALING WITH GENE TARGETS 
HAVING NO PREDICTABLE ^CnoKpSmLs 



and allele exchange methods for validatui the "tiaht? 5 
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those genes, provides both gene targets whose cellular function 
can be predicted and gene targets for which little or no func- 
tional information is available. Targets in the first class may 
be used immediately to build biochemical assays and high- 
throughput screens to detect small organic molecules which 
inhibit the biochemical activity Typically, the gene sequence is 
amplified by PCR from genomic DNA of a given bacterium, 
inserted into an expression. vector, and expressed in R coli 
sometimes with affinity tags to facilitate purification of the 
resulting protein product 

It is far less obvious how to proceed with gene targets lacking 
any functional information. This problem has attracted consid- 
erable attention in recent years because of the growing number 
of such targets known to be shared across many bacterial 
species" (24), some of which are known to be essential in at least 
one species. As a general guide, about 409fc of bacterial genes 
cannot be assigned a putative function at this time. If 10 to 
15% of these jrenes are essential, then 4 to 6% of the genes in 
a typical bacterial genome (about 1DD genes) represent po- 
tential antimicrobial targets which have never been used in 
screens. Three basic types of approaches seem feasible and 
have shared some initial success. First, cells expressing higber- 
or lower-than-normal levels of particular genes have in some 
cases been shown to be more resistant or -more sensitive, re- 
spectively, than their wild-type parents to chemical compounds 
known to inhibit those gene products. For example, -overex- 
pressioD of the yeast AL G7 gene results in cells more resistant 
than wild-Type cells to tunicarnycin (38), while reduced activity ' 
of the same gene product results in cells more sensitive to the 
drug (30). 'Sirnilarry, increased expression of the ERG11 gene 
in Candida giabrata results in higher levels of resistance to the 
• azole family of drugs which target that enzyme (54). A gene of 
.unknown function could be over expressed in a host strain, and 
the resulting assay strain could be tested for increased resis- 
tance to a library of compounds. It is clear, however, that many 
gene targets when overexpressed do not lead to resistance to 
chemical compounds that are known to bind to the protein 
product (e.g., gyrA [52]). Furthermore, overexpression of pro- 
teins often leads to lethality or growth defects (e.g., kasA [34]). 
Alternatively, a gene could be underexpressed or crippled by a 
mutation so that cells might show increased sensitivity to a 
compound which inhibits the protein product Scientists at 
MicrDcide Pharmaceuticals, Inc., have applied this approach 
on a large scale using temperature-sensitive mutants grown at 
intermediate temperatures in order to reduce the level of- ac- 
tivity of the target gene product (39a). Of course, it is not clear 
what fraction of unknown gene products would provide the. cell 
with increased drug resistance or sensitivity when over- or 
underexpressed in these ways. 

The second approach to this problem of assaying gene prod- 
ucts of unknown function is probably more generally applica- 
ble. Libraries of small molecules are screened for strong bind- 
ing affinity to proteins' of unknown function. This has. been 
achieved with peptides in phage display libraries because bind- 
ing can be readily detected by dunon of bound phage from the 
protein tethered on a solid support. Proteins of unknown func- 
tion can be produced easily as affinity fusion products for 
attachment to solid supports, and a variety of peptide phage 
display libraries are commercially available. Conformationally 
constrained disulnde-bonded peptides with affinities in the 100 
u.M to 100 nM range can be obtained by this approach (55). Of 
course, not all peptides detected by this approach will bind tD • 
sites which inhibit activity, but an elegant'new method, caHed 
'Validation in vivo of targets for ami-infectives" (V3TA), has 
been devised to identify those peptides which inhibit essential 
cellular functions (49). Potential inhibitory peptides were ex- 



pressed in a regulated manner within bacterial host cells which 
were grown either on agar medium or in an animal model of 
infection. Inhibition of cell growth or viability upon induction 
of peptide expression validated the peptide-protein interaction 
as useful for further drug development While peptides are not 
ideal drug candidates, a wider array of techniques are appli- 
cable after a moderate binder has been obtained. The peptide 
may be used as a surrogate ligand in a competition assay to 
identify a small organic comp ound with higher affinity. Scintil- 
lation proximity assays (26*) or fluorescence polarization assays 
(41) may be used in a high-throughput mode to identify com- 
pounds in chemical libraries which compete for binding with a 
labeled peptide. Alternatively, ligand binding assays may be 
configured to work directly on libraries of unlabeled chemical 
compounds. Shuker et aL (42) have described a nuclear mag- 
netic resonance-based method capable of a throughput of 
1,000 compounds per day. Mass Bpectrometric methods are 
also of interest as potentially rapid ways to detect' bound Ji- 
gands from chemical libraries. One concern about these ap- 
proaches is that proteins may have multiple accessible binding 
sites,, many of which have nothing to do with catalytic activity. 
It is not clear- at this early stage how significant an issue mul- 
tiple binding sites wfll be. However, it is worth noting that 
Shuker et ai. (42) took advantage of a Becond binding site to 
increase the affinity of an inhibitor for the protein. Ultimately,* 
of course, affinity ligands must be shown to inhibit cell growth, 
-that is, to have antimicrobial activity. Some chemical, engineer- 
ing of the compound may be required to increase microbial 
uptake. 

A third approach for assaying gene products of unknown 
function relies on the complex gene expression regulatory net- 
work found in many bacteria. Expression levels of genes in- 
metabolic pathways are often regulated in response to the 
amounts of intermediates in the cell For example, disruption 
of the general secretory pathway in R coli by mutation results 
in dramatic up-regulation of secA gene expression (37). Alksne 
et al. (2) took advantage of this fact to build a strain of £ coli 
carrying a secA~lacZ fusion as a detectable reporter. Several 
synthetic compounds and natural products were identified by 
their ability to induce expression of the reporter. Many of these 
exhibited antimicrobial activity and reduced the secretion of 
Staphylococcus aureus toxin 1. Similarly, Mdluli et aL (34) have 
reported that sublethal concentrations of isoniazdd lead to up- 
regulation of the kasA and acpM genes. This group has initi- 
ated a whole-cell, high-throughput screen of chemical com- 
pounds which induce expression of a ludferase reporter fused 
*to a gene in this regulated pathway. Screens of this type, which 
take advantage of the bacterial gene regulatory network, are 
inherently less specific than the two other types described here. 
In addition, they suffer from the basic limitation of all whole- 
cell screens: compounds must be capable of enterixag'the cell in 
order to be detected. However, these types of seres ens offer the 
potential advantage of identifying compounds which act at any 
of several points in a pathway. 

. CONCLUSIONS 

The availability of genomic sequence information for all or 
nearly all of several different bacterial species provides impor- 
tant new advantages for target discovery. First, it p emails use of 
a comparative genomic analysis to identify potential new tar- 
gets shared across several bacterial species or pa-rticular to a 
single species. In this manner, it is possible to gen srate lists of 
genes which represent potential targets for broad— spectrum or 
highly focused narrow-spectrum antibiotics. Sequs-xice compar- 
isons can also provide some assurance against mammalian 
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tooriciTy if proteins of similar sequence do not exist in mamma- 
lian sequence databases. Second, sequence similarity provides 
some insights imo putative functions for most gene products. 
Finally, availability of the entire sequence of the gene target of 
interest permits rapid construction of gene knockouts to val i- 
date the utility of the target and facile construction of espres- 
Bion plasmids for production of protein and development of 
assays. The fact that bacterial and fungal- genes can be assessed 
rapidly for their relevance as potential antibiotic targets by 
dfiffinnining the effect* of knocking our the gene and the fact 
that their genomes are small enough to be sequenced in their 
entirety are' compelling reasons that the field of genomics will 
likely find its first real utility in the development of new anti- 
microbials. 
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